STATISTICS OF GRAVITATIONAL MICROLENSING 
MAGNIFICATION. II. THREE-DIMENSIONAL 
LENS DISTRIBUTION 



Man Hoi Lee 

Department of Physics, Queen's University, Kingston, Ontario K7L 3N6, Canada 

Arif Babul 

Department of Physics, New York University, 4 Washington Place, New York, NY 10003 

Lev Kofman 

Institute for Astronomy, University of Hawaii, 2680 Woodlawn Drive, Honolulu, HI 96822 

and 
Nick Kaiser 

CIAR Cosmology Program, Canadian Institute for Theoretical Astrophysics, University of 
Toronto, 60 St. George Street, Toronto, Ontario M5S 1A7, Canada 

ABSTRACT 

In Paper I we studied the theory of gravitational microlensing for a planar 
distribution of point masses. In this second paper, we extend the analysis to 
a three-dimensional lens distribution. First we study the lensing properties of 
three-dimensional lens distributions by considering in detail the critical curves, 
the caustics, the illumination patterns, and the magnification cross-sections cr(A) 
of multiplane configurations with two, three, and four point masses. For A* point 
masses that are widely separated in Lagrangian space, we find that there are 
~ 2 W * — 1 critical curves in total, but that only ~ A* of these produce prominent 
caustic-induced features at the high magnification end of (J (A). 

In the case of a low optical depth random distribution of point masses, we 
show that the multiplane lens equation near a point mass can be reduced to the 
single plane equation of a point mass perturbed by weak shear. This allows us to 
calculate the caustic-induced feature in the macroimage magnification distribu- 
tion P(A) as a weighted sum of the semi-analytic feature derived in Paper I for 
a planar lens distribution. The resulting semi-analytic caustic-induced feature is 
similar to the feature in the planar case, but it does not have any simple scaling 
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properties, and it is shifted to higher magnification. The semi-analytic distri- 
bution is compared to the results of previous numerical simulations for optical 
depth r ~ 0.1, and they are in better agreement than a similar comparison in the 
planar case. We explain this by estimating the fraction of caustics of individual 
lenses that merge with those of their neighbors. For r = 0.1, the fraction is 
~ 20%, much less than the ~ 55% for the planar case. 

Subject headings: gravitational lensing 

1. INTRODUCTION 

The gravitational lensing effect of an ensemble of point masses has long been a subject 
of investigation (see, e.g., Schneider, Ehlers, & Falco 1992 for a summary of earlier work). 
Most of the studies have focused on the situation where the spatial distribution of the point 
mass lenses is "compact," i.e., the size of the region within which the lenses are distributed is 
much smaller than the other distances (those between the source and the lenses and between 
the observer and the lenses) in the problem. For gravitational lensing calculations, a compact 
distribution of lenses is usually approximated by its projection onto a single lens plane. The 
important case of low optical depth, single plane microlensing has been investigated in great 
detail (Kaiser 1992; Mao 1992; Rauch et al. 1992; Kofman et al. 1996, hereafter Paper I; see 
also Schneider et al. 1992 and references therein), and we now understand reasonably well 
the caustic structure, the illumination pattern, the probability distribution of macroimage 
magnification, and the flux variability of macroimages in this case. 

There are, however, situations in which the lensing objects are modeled by an "extended" 
(or three-dimensional) distribution of point masses, i.e., by a point mass distribution that 
is comparable in scale to the other distances in the problem. A well-known example is the 
microlensing of stars in the Large Magellanic Cloud and the Galactic bulge by foreground 
stars and possibly massive compact halo objects (MACHOs) (Alcock et al. 1993; Aubourg 
et al. 1993; Udalski et al. 1993). The theoretical analysis of this problem is relatively simple 
because the optical depth to microlensing is very small (r < 10~ 5 ) and the point mass 
lenses are essentially acting independently (Paczyhski 1986, 1991; Griest 1991). (Note, 
however, that microlensing by physical binary systems is also possible; Mao & Paczyhski 
1991; Udalski et al. 1994; Alard, Mao, & Guibert 1995.) A more complicated problem is 
the possibility that a substantial fraction of the dark matter in the universe is in the form 
of compact objects (e.g., Carr 1994). Then the universe as a whole has a significant optical 
depth to microlensing (Press & Gunn 1973; Dalcanton et al. 1994). This latter problem 
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has been studied by several authors using numerical simulations (Refsdal 1970; Schneider & 
Weiss 1988a,b; Rauch 1991), and it was found that there are qualitative differences between 
two- and three-dimensional microlensing. For example, for r ~ 0.1, the three-dimensional 
macroimage magnification distribution does not show the prominent caustic-induced feature 
produced by a planar distribution of point masses (Rauch et al. 1992; but see § 4 below). 

The usual approach to gravitational lensing by a three-dimensional distribution of lenses 
is the multiple lens-plane approximation (Blandford & Narayan 1986; Kovner 1987). The 
properties of multiplane gravitational lensing and, in particular, the reasons for many of the 
differences between two- and three-dimensional microlensing have not been investigated in 
detail, and they are not yet well understood. Besides the statistical microlensing studies 
mentioned above, there have been few studies of the basic mathematical properties of mul- 
tiplane gravitational lensing (Levine & Petters 1993; Levine, Petters, & Wambsganss 1993) 
or the properties of specific multiplane configurations (Kochanek & Apostolakis 1988; Lee 
& Paczyhski 1990; Erdl & Schneider 1993, hereafter ES). This was perhaps due to the lack 
of lensing candidates with lenses at different redshifts. However, recent observations of the 
lens systems 2345+007 (Pello et al. 1996), 2016+112 (Lawrence, Neugebauer, & Matthews 

1993) , and 0957+561 (Bernstein, Tyson, & Kochanek 1993; but see Dahle, Maddox, & Lilje 

1994) suggest that these systems may involve lenses at different redshifts. 

As we shall see in § 3, to understand some of the differences between two- and three- 
dimensional microlensing, it is useful to study in detail specific cases of simple three-dimen- 
sional lens configurations. The study by Erdl & Schneider (ES) of the lens system consisting 
of two point masses at different distances from the observer is a step in this direction. They 
considered primarily the critical curves and the caustics and were able to obtain a complete 
classification of their topological properties with respect to the model parameters in this 
particular case. 

The main motivation of this study is to give a systematic theory of gravitational mi- 
crolensing by a random distribution of point masses. In the accompanying Paper I we 
refined the theory of gravitational microlensing for a planar distribution of point masses. In 
particular, we obtained a practical semi-analytic expression for the macroimage magnifica- 
tion distribution, P(A), for low optical depth r, by allowing for the shear perturbations of 
neighboring lenses. In this paper we study the properties of gravitational microlensing by 
a three-dimensional distribution of point masses. We are interested in the critical curves, 
the caustics, the illumination pattern, and especially the macroimage magnification distri- 
bution. We shall not consider other interesting aspects such as time delays and the temporal 
behavior of microlensing events (note, however, that the latter is given by the magnification 
along tracks through the illumination pattern). In § 2 we write down the lens equations. In 
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§ 3 we study in detail multiplane configurations with two, three, and four point masses to 
find out what new features are expected in three-dimensional microlensing. We also explain 
qualitatively why these new features arise. The case of low optical depth three-dimensional 
microlensing is studied in § 4, using an approach similar to that used in Paper I to study 
the caustic-induced feature in P(A) for the planar case. The resulting semi-analytic P(A) 
is compared to the numerical results of Rauch (1991), and the differences between the two- 
and three-dimensional results are discussed. Our conclusions are summarized in § 5. 



2. LENS EQUATIONS 

We begin by establishing the relevant equations for gravitational lensing by a three- 
dimensional lens distribution. As in Paper I, we shall only consider gravitational lensing in 
an Einstein-de Sitter cosmological background weakly perturbed by the gravitational field 
of the lenses. Our notation is identical to that used in Paper I, and the readers are referred 
to § 2 of Paper I for the definitions of terms not explicitly defined here. 

Let us consider an isotropic point source at the origin of our coordinates and an observer 
plane at a comoving distance Xso from the source (see Fig. 1). In the absence of perturbations, 
the observer plane would be uniformly illuminated and a light ray which leaves the source 
with angle 9 = (9 1: 9 2 ) would pierce the observer plane at x = Xso®- This defines our planar 
Lagrangian coordinates x. In the presence of density inhomogeneities, the light ray would 
suffer deflections and pierce the observer plane at the Eulerian coordinates r. 

For a three-dimensional (or extended) lens distribution, gravitational lensing is usu- 
ally studied within the framework of the multiple lens-plane approximation (Blandford & 
Narayan 1986; Kovner 1987). It is assumed that the lens distribution can be approximated 
by a series of N geometrically thin configurations, i.e., lens planes, and that the separations 
between the lens planes are sufficiently large. Then the deflection of a light ray by one of the 
lens planes is not influenced by the others and a light ray is deflected N times on the way 
from the source to the observer (see Fig. 1). After j — 1 deflections, the location at which 
the light ray pierces the j'th lens plane, projected onto the observer plane, is 

j'-i 

x i = x i + J2Pij S i( X i)i (!) 

i=i 

where Sj(cCj) is the deflection by the ith plane (which can be expressed as the gradient with 
respect to x^ of an effective surface gravitational potential on plane i), is a dimensionless 
distance parameter defined by 

a _ XijXso f \ 
AsjAio 
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and Xij is the comoving distance between planes i and j. Note that can take on values 
between (in the limit i = j) and 1 (in the limit % = s or j = o). If we denote the observer 
plane as plane N + 1, then the Lagrangian to Eulerian mapping is 

r = x + s, (3a) 

where 

N 

r = x N+1 , x = x 1 , s = J2 s i( x i)- ( 3b ) 

i=l 

Unlike the single plane lens mapping, this multiplane lens mapping is not in general a 
gradient mapping. 

For the particular case where the lens planes are populated by point masses, 

/ \ XioXso \ sr-^ Xi Xik 

Si(xi) = -4G[ )l^ m ik] i?> ( 4 ) 



where cij is the cosmological scale factor at the redshift of lens plane i and x^ is the location 
of the kth lens (of mass m^) on lens plane i. Note that x^ is the position projected onto 
the observer plane, i.e., Xik = Xso@ik where 0^ is the angular position of the lens from the 
source. Henceforth, we shall assume for simplicity that all the point masses have the same 
mass m. Then 

Si{Xi) = —X 2 E i ^ I _ (5) 

where 

/ AGmXioX s ' 2 



XE,i = (6) 

V a iXsi J 

is the (Lagrangian) Einstein radius of a point mass on the ith lens plane. For a random 
distribution of point masses, the optical depth to microlensing is r = n J2i=i n % x \.ii where 
n i — ( a iXsi/Xso) 2 ^i/ r n and Ej are, respectively, the projected (onto the observer plane) 
surface number density and the physical surface mass density of point masses on lens plane 
i. When we consider the situation where a significant fraction of the dark matter in the 
universe is in the form of cosmologically distributed compact objects, since the mean of the 
density inhomogeneities should be zero, we should compensate the randomly distributed 
point masses on each plane by a negative smooth surface mass density. Then a term of the 
form imiX 2 Ei Xi should be added to the right hand side of equation (Eh. 



3. MULTIPLANE LENSING BY A FEW POINT MASSES 

As we mentioned in § 1, numerical simulations of gravitational microlensing by a three- 
dimensional random distribution of point masses have revealed differences between two- and 
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three-dimensional microlensing (Refsdal 1970; Schneider & Weiss 1988a,b; Rauch 1991). 
To find out and understand qualitatively some of these differences, it is useful to study in 
detail specific cases of simple lens configurations. In this section we study numerically the 
properties of multiplane gravitational lensing by a few point masses. To reduce the number 
of parameters, we restrict our study to cases where the distance between the source and the 
observer is much smaller than the horizon scale, i.e., a « do- 

If there are Ni point masses on lens plane % (we shall denote the total number of point 
masses as iV* = Y^=\ Ni), the sum of the squared (Lagrangian) Einstein radii is 

N 

4 = E^4,- (7) 

i=l 

For the numerical computations, it is convenient to express all positions in units of xe and 
to define a dimensionless mass parameter 

< - ^ (8) 



x 



E 

so that YliLi A^m- = 1. Then the multiplane lens equation with identical point masses 
(eqs. UU and ||) can be written as 

x — x 

x j = x i - 2 AX E i 'V ( 9 ) 

i=l k l x * 



3.1. Configurations with Two Point Masses 

First let us consider in detail configurations with two point masses. Without any loss 
of generality, we can choose a coordinate system such that x n = and x 2 i = (d,0). Since 
we assume that a « a , the two remaining parameters, /3 12 and m' x , are related by m' l = 
1/(2 -Aa). 

The topology of the critical curves (and the caustics) as a function of the parameters 
(d, (5\2, has been classified by ES (see also Schneider & Weiss 1986 for the /?i 2 = case). 
In general, for given /3 12 and m' 1; there are five transitions in the topology of the critical 
curves as d increases from (see Fig. 6 of ES). However, in the limit flu — > (i.e., in the 
limit of two point masses on the same plane), three of the transitions merge at d — and 
the number of transitions is reduced to two. For comparison, we show in Figures 2-7 two 
sets of models with /3 12 = and fl\i — 0-3. (We have also computed models with (3 12 = 0.6, 
but the results are qualitatively similar to those with /3i 2 = 0.3 and are not shown here.) 
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The configurations shown in Figures 2-7 were chosen to sample regions of the parameter 
space with different critical curve topologies. The parameters (/3i 2 , d) and the critical curve 
topology types (in the notation of ES) of these configurations are listed in Table 1. (Note 
that the topology type F is not represented; with m[ = 1/(2 — P12), type F only occurs at 
very small d.) 

In Figures 2 and 5 we show the critical curves of the configurations with (3\2 = and 
0.3 respectively. The corresponding caustics and iso-magnification contours (on the observer 
plane) are shown in the left and right panels of Figures 3 and 6. In Figures 4 and 7 we plot 
the "normalized" differential cross-section <p(A) = a(A)/ao(A), where cr(A) is the differential 
cross-section for magnification A and 

is the differential cross-section of a single isolated point mass lens with Einstein radius xe 
as defined in equation (|7|). As we saw in Paper I (see also § 4 below), the function ip(A) is 
a useful measure of the deviations of the cross-section of a more complex lens configuration 
from that of the point mass lens. The cross-sections and the iso-magnification contours shown 
in these (and all subsequent) Figures were obtained using the ray-shooting method (see, e.g., 
Kayser, Refsdal, & Stabell 1986; Schneider & Weiss 1986, 1988b). Since the magnification 
factor is calculated in this method by collecting rays into a grid of square pixels, it is the 
magnification for an observing device (or equivalently a source) of the size of one pixel. To 
assess the effects of finite pixel size, we performed two sets of calculations, with the linear 
pixel sizes in one set a factor 1.5-2 smaller (see Table 1). The pixel sizes were chosen so that 
all (except the very small) caustics are resolved. We show in Figures 3 and 6 the contours 
from the calculations with the smaller pixels and in Figures 4 and 7 the function <p(A) from 
both sets of calculations. As expected, the finite pixel size introduces an upper limit to the 
magnification factor and produces extra "bumps" in <p(A) (Rauch et al. 1992), but it does 
not have significant effects on the position or the strength of the real caustic-induced features 
in (f(A). 

The P12 = (and m! x = 0.5) models have already been studied in detail by Schneider 
& Weiss (1986); hence, we only give a brief description of these models here. For d > 2, 
the region near each of the point masses is perturbed by the weak shear from the other 
point mass. So we have two critical lines that are slightly flattened ellipses and two caustics 
that have the familiar "astroid" shape (see § 3 and Fig. 2 of Paper I). As d — > 2, the critical 
curves and the caustics become asymmetric (Figs. 2a and 3a). The corresponding differential 
cross-section (Fig. 4a) shows caustic- induced features that are similar to those found in the 
case of a point mass perturbed by shear (see Fig. 3 of Paper I). There is, in addition, a bump 
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(at A ps 2) which is associated with the last iso-magnification contour that encloses both of 
the caustics. As d decreases below 2, the number of critical curves (and caustics) changes 
from two to one to finally three, and the topology of the critical curves (and the caustics) 
changes from A' to B' to finally C (see Table 1). In all cases, there are caustic-induced 
features in the differential cross-section at moderate to high A and the cross-section at small 
A is reduced. Each of these features is associated with either a local minimum or saddle 
point in the magnification pattern or an iso-magnification contour that touches the caustics 
at one or more points. In the latter case, the contours of slightly larger A are broken into 
disconnected shorter segments by the caustics and the loss of the area inside the caustics 
results in a drop in (fi(A). (An example that has been discussed in detail in § 3 of Paper I 
is the feature associated with the contour osculating the astroid-shaped caustic of the point 
mass plus weak shear lens.) 

When the two point masses are not on the same lens plane, there are new features and 
transitions in the critical curves and the caustics (see Figs. 5 and 6). At large Lagrangian 
separation d (Fig. 5 a), besides the two critical curves that are similar to the ones seen in 
the flu = case (see § 4 for an explanation), there is an extra critical curve inside the 
critical curve that encloses point mass 1 (i.e., the point mass closer to the source). This 
critical curve is offset from point mass 1 in the direction opposite point mass 2 and both 
its size and its distance from point mass 1 increase with decreasing d. The corresponding 
caustic is located at a large distance along the positive x-axis (Fig. 6 a). This caustic has the 
(asymmetric) astroid shape of the two caustics that are known from the f3i 2 = case, and it 
increases in size and moves toward the other two caustics as d decreases. The critical curve 
(and caustic) configuration at large d that we have just described is denoted as topology 
type A by ES. Except for the presence of the new critical curve and caustic, the topological 
transitions shown in Figures 5a-c and 6a-c are similar to those shown in Figures 2 and 
3. The remaining transitions do not exist in the single lens plane case. However, as ES 
pointed out, except for the elliptic umbilic transition between C and D, all transitions are 
the beak-to-beak type. 

We can explain the new critical line (and caustic) and its qualitative behaviors if we 
focus on the configurations of topology types A—C. For these configurations, it is clear that 
the new critical line is composed of the set of rays that are strongly deflected by point mass 
1 (on lens plane 1) to the vicinity of point mass 2 (on lens plane 2) and are then infinitely 
magnified by point mass 2. Obviously this process can only occur when the point masses 
are not on the same lens plane. The new critical line is closer to point mass 1 for larger d 
because a light ray must be more strongly deflected by point mass 1 to reach the vicinity of 
point mass 2. 
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In the magnification patterns (Fig. 6), there are local minima and saddle points that are 
associated with the new caustic; there are also iso-magnification contours that touch the new 
caustic at two points (e.g., the second outermost contour in Fig. 6d). These new features 
in the magnification patterns are expected to produce corresponding features in the cross- 
sections. However, as we can see in Figure 7, the "normalized" differential cross-sections 
tp(A) are actually quite similar to those shown in Figure 4. Features associated with the 
new caustic are not noticeable until d <^ 0.6, and even in cases where the new caustic (which 
is transformed at small d) encloses a large area of the observer plane, the new features are 
typically weak and at A £ 2. (They are stronger and at higher A for the fi 12 = 0.6 models.) 

If we examine the magnification patterns (and Fig. 8) more carefully, we see that the 
new caustic is usually located in regions where the background magnification is small. In 
addition, the magnification throughout most of the region inside the new caustic and, in 
particular, the minimum magnification inside the new caustic are only slightly higher than 
the low background magnification. Hence the new features in the magnification patterns and 
the corresponding features in <p(A) have small A values. Next, let us consider specifically the 
feature associated with the iso-magnification contour that touches the new caustic at two 
points (e.g., the second outermost contour in Fig. 6d). This contour is long and, except for 
the distortion near the new caustic, nearly circular. Contours of slightly higher magnification 
are broken by the new caustic into a short arc around one of the cusps and a long arc that 
encloses the other caustic(s). This results in a decrease in the cross-section a {A) because 
an area that would otherwise be enclosed by two of these contours (of magnification A and 
A + dA) is lost to the interior of the caustic. Although the area lost to the caustic can be 
large, it is typically much smaller than the area enclosed by the long (and the short) arcs. 
Consequently, the decrease in tp(A) = a(A)/a (A), which is approximately the fractional 
decrease in cr(A), is small. Similarly, the increase in <p(A) at A = A m[n , where A min is 
the minimum magnification inside the new caustic, is small because A m[n is typically small 
and the area enclosed by the contours of magnification A min and A min + dA outside the new 
caustic (mainly two long arcs) is much larger than the area enclosed by the contour A min + dA 
inside the new caustic. 

The relatively weak influence of the new caustic on the magnification pattern is demon- 
strated more clearly in the sample of magnification profiles plotted in Figure 8. Although 
the magnification must be infinite on the caustic, only a very small region near the caustic 
has A» 1. In fact, the numerically obtained A, which is finite at a caustic because of the 
finite pixel size of our calculations, shows a maximum at the new caustic that is quite small. 
This behavior of the magnification pattern near the new caustic matches the description of 
the "thin caustics" found in the microlensing simulations of Schneider & Weiss (1988b). We 
have therefore found in the simple two-point-mass case a clear demonstration of the occur- 
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rence of thin caustics. As we mentioned earlier, the new caustic consists of light rays that 
are strongly deflected by the first point mass to the vicinity of the second point mass and are 
then infinitely magnified by the second point mass. In the simulations of Schneider & Weiss, 
which have a large number of point masses distributed on several lens planes, a light ray 
can be strongly deflected on more than one lens plane. It is likely that most of the caustics 
that are strongly deflected on one or more lens planes (i.e., the secondary caustics defined 
in § 3.2 below) are thin caustics. This identification for the thin caustics is consistent with 
the conjecture by Schneider & Weiss that the thin caustics are caused by steep deflection 
gradients because the regions (in Lagrangian space) with strong deflections are also regions 
with steep deflection gradients. 



3.2. Configurations with Three and Four Point Masses 

Now let us consider briefly configurations with three and four point masses. Two con- 
figurations with iV* = 3 point masses and one with N* = 4 are shown in Figures 9-11. These 
configurations have one point mass on each lens plane (i.e., N = iV*) and their parameters 
are listed in Table 2. The configuration a with iV* = 3 was chosen because it has the criti- 
cal curve topology of point masses widely separated in Lagrangian space while the smaller 
critical curves are clearly resolved. The other configurations are "typical" configurations 
resulting from randomly distributing the point masses within a cone of constant Lagrangian 
radius. We have (arbitrarily) chosen a cone radius such that a light ray passing within the 
cone would be infinitely magnified if the mass of the lenses were uniformly distributed within 
the cone. 

A striking feature of Figure 9 (and Fig. 5) is the rapid increase in the number of critical 
curves with iV*. For a configuration with iV* point masses well separated in Lagrangian 
space and with each point mass on its own lens plane (i.e., N = TV*), we can calculate the 
number of critical curves by extending the discussion given in § 3.1 for the origin of the new 
critical curve. Light rays from the source can reach the vicinity of the point mass on the 
ith lens plane after being strongly deflected by any combination (or none) of the previous 
% — \ point masses. For each combination, there is a set of rays that are infinitely magnified 
by the ith point mass (and the weak deflections on the remaining planes). Therefore, there 
are 2* _1 critical curves associated with the ith point mass and the total number of critical 
curves is 2^* — 1. The critical curves shown in Figures 5 a and 9 a are consistent with these 
numbers. (The total number of critical curves is smaller for configurations with N < N* and 
approaches iV* in the limit N = 1, i.e., in the single lens plane case.) Hereafter, we shall 
refer to the caustics that are not strongly deflected as the primary caustics and those that 



- 11 - 



are strongly deflected on at least one lens plane as the secondary caustics. 

Although the number of critical curves increases exponentially with iV*, we can see in 
Figure 9 that the size of a critical curve decreases rapidly as the number of strong deflections 
associated with the critical curve (or equivalently the number of critical curves within which 
it is nested) increases. On the other hand, as in the two-point-mass case, a caustic formed 
after a larger number of strong deflections is not necessarily smaller. For example, in Figure 
10c, the caustic located near (—0.3, 1.2) [which corresponds to the critical curve located near 
(0.25, —0.4)] and the caustic with six cusps [which corresponds to the largest critical curve] 
are comparable in size. 

In the two-point-mass case, due to the absence of overlapping or self-intersecting caus- 
tics, a complete classification of the critical curve topology also provides a complete classifi- 
cation of the caustic topology (ES). This is not true when iV* > 2. In particular, there are 
overlaps of caustics that belong to different critical curves. Examples of such overlapping 
caustics are the caustics located near (0.3,0.3) in Figure 10a and those in the lower left 
corner of Figure 10 c. (There are no self-intersecting caustics in the configurations shown in 
Fig. 10, but they may be present in other configurations.) 

Finally, we discuss the normalized differential cross-sections plotted in Figure 11. For 
the "typical" configurations b and c, the qualitative behaviors of <f(A) agree with what one 
might expect from the two-point-mass configurations studied in § 3.1. The cross-section 
at small A is smaller than that for a single point mass. There are prominent features at 
moderate to high A that are associated with the primary caustics. Features associated with 
the secondary caustics are typically weak, but some of them are noticeable at A <^ 2. For the 
configuration a, the features associated with the primary caustics, i.e., those at log A ^ 0.8, 
are similar to the features in Figures 4a and 7a. This is expected since configuration a has the 
critical curve topology of widely separated point masses. However, the features associated 
with the secondary caustics are at unusually high A: the "plateau" between log A ps 0.55 
and 0.8 consists of a series of such features. This is due to the fact that the parameters of 
configuration a were specially chosen so that the smaller critical curves are clearly resolved. 
For these parameters, the caustics are unusually close to one another (see Fig. 10 a and 
notice the change in scale between Figs. 10a and 106). As a result, the secondary caustics 
are located in regions of relatively high background magnification and the associated features 
in <p(A) are at unusually high A. 
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4. THREE-DIMENSIONAL LOW OPTICAL DEPTH MICROLENSING 

The study of simple lens configurations in the previous section shows that microlensing 
by a three-dimensional distribution of point masses is, in general, much more complicated 
than microlensing by lenses confined to a single lens plane. But there are also indications 
that, in the low optical depth limit, some aspects of the three-dimensional problem can be 
reduced to the planar one. For example, the multiplane configurations with point masses 
widely separated in Lagrangian space show primary caustics and corresponding features in 
ip{A) that are similar to those found in the single plane configurations. In § 4.1 we confirm 
this analytically for the limiting case of a point mass lens perturbed by the weak shear from 
the other lenses. Another useful result from the study in § 3 is that the significant features 
in <f (A) at moderate to high A are associated with the primary caustics. Therefore, we can 
neglect the features associated with the secondary caustics when we analyze the macroimage 
magnification distribution, P(A), at high A. In § 4.2 we derive the feature induced by 
the primary caustics at the high magnification end of P(A) for a three-dimensional low 
optical depth lens distribution. The resulting semi-analytic P(A) is compared to previous 
numerical results in § 4.3, and the differences between the two- and three-dimensional results 
are discussed in § 4.4. 



In the low optical depth limit, lensing (other than low magnification events) by a three- 
dimensional distribution of point masses is, as in the planar case, typically dominated by 
a single point mass. Without loss of generality, we shall designate one of the point masses 
residing on the Lth lens plane as the dominant one and locate this point mass at the origin 
of the lens screen. On all of the other screens, the light ray is only weakly perturbed, and 
the perturbations are expanded to the first order in £Cj. Before the light ray passes the Lth 
screen with the dominant lens (i.e., for j < L), the lens mapping (eqs.[[l| and ||) is simplified 
to 



4.1. 



Cross-Section of the Point Mass Lens Plus Weak Shear in 
Three Dimensions 




(11) 



8=1 



where the constant deflection 



a i — x2 E,i 



k 




(12a) 



13 



and the shear matrix 

c _ ™2 1 ( X 1k ~ Uik 2 x ikUik \ / 19 , x 

t >i- x E,i2^T—n\ 2 ? ,2_ T 2 y iZb ) 

k \<Eik\ \ ^^ikijik !Jik x ik J 

(similar to Qi and Sl defined in eq.[13] of Paper I) represent the perturbative influence of 
the lenses on the ith plane. Equation ([11]) can also be written as 

Xj = BjXi + uj, (13) 

where 

Bj=Z+22 PijSiBi, Uj = Pij {<*i + Si U i) , ( 14 ) 

8=1 1=1 

and X is the identity matrix. Once the light ray has traversed the Lth lens screen, the 
location at which the light ray intersects subsequent (i.e., j > L) screens is given by 

xj = BjXi + Uj + f3 Lj C j d L (x L ), (15) 

where 



d L {x L ) = (16) 

\x L r 



denotes the influence of the dominant lens and 



Cj = i+ £ (^r^SiCi. (i7) 



i=L+l 



fa 



Note that, in the absence of the dominant lens, Uj is the path followed by a light ray with 
xi = 0. 

On the observer plane [which is the (N + l)th screen, but we also denote it with the 
subscript "o"], the Eulerian position is given by 

r = B x + u + C d L {x L ). (18) 

If we use the fact that x = x\ = Bj} (xi — ul) and transform the Eulerian variable ac- 
cording to r — ► r' = C^ 1 (r — u Q + B Bj}u L \ the above equation can be cast into the form 



r' = x f + s'(x'), (19a) 

where x' = xl, 

s'(aj') = d L {x') + S' L x', and S' L = C~ x B Bl x - X. (19b) 



-14- 



Although there is only shear on each lens screen, the matrix S' L is in general a combination of 
shear and convergence (i.e., it is symmetric but not traceless). Thus equation (19) is formally 
equivalent to the single plane lens equation for the point mass lens plus shear and smooth 
surface density (compare with eq. [15] of Paper I for the point mass plus shear lens). This is 
a specific example of the "non-linear telescope" (or generalized quadrupole lens) discussed 
by Kovner (1987; see also Schneider et al. 1992). 

We are interested in the case where the shear perturbations on all the lens screens are 
weak. To first order in Si, the expressions for C , Bj, and S' L are 

N j-1 
i=L+l i=l 



and 



§' L ^j^(^A§ t + £ f^^]4 (20b) 



i=l 



XsLXio , 




respectively. Hence S' L is reduced to a shear matrix: S' L ~ S' \xtofI \4>'l) > where S'(xlo) is 
the magnitude of the "effective" shear and 

f(6')-( COS2(l) ' L sin2< M (21) 
1 {<Pl) ~ { sin 2^ -cos20' L J' {2l) 

Then equation (19) reduces to the single plane lens equation for the point mass plus weak 
shear lens (eq. [15] of Paper I). 

In §§ 3.1-2 of Paper I, we studied in detail the properties of single plane lensing by the 
point mass plus weak shear lens. The results are applicable to the mapping from x' to r' 
(eq.[19]). We can then infer the properties of the lens mapping from x to r (eq. ||18|| ) by 
taking into account the effects of the transformations x — ► x' and r — ► r'. First, it is easy 
to see that, to first order in Si, both transformations are a combination of translation and 
shear distortion and are one-to-one. Second, since the deformation tensor 

there is a simple correspondence between the caustics (and the critical curves) of the map- 
pings (jlSf) and (19). In addition, because the determinants \C a \ and \Bt\ are 1 + 0(S' 2 ), 
the magnification at a point r is, to first order in S'(xlo), identical to the "magnifi- 

cation" \dr' /dx'l^ 1 at the corresponding point r'. Similarly, the correction factors for the 
differential cross-section are 1 + 0(S' 2 ). All of the corrections that we have just considered 
are of the same order as the terms already ignored in the weak shear analysis in Paper I. 
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Therefore, the effects of the transformations are negligible, and the properties derived in 
§§ 3.1-2 of Paper I are also applicable to the lens mapping (|i~8|). Briefly, for a point mass lens 
perturbed by weak shear in three dimensions, the critical line is a slightly flattened ellipse 
(of size xe,l)] the caustic has the shape of an astroid, with the four cusp catastrophes (at a 
distance 2xe,lS'(xlo) from the center of the astroid) connected to each other by four fold 
catastrophes (see Fig. 2 of Paper I); the differential cross-section has the scaling property 
(see eq. [23] of Paper I) 

2tix 2 

a(A\S'[ X Lo)) = {A2 _ E ^ /2 v(AS'[ X Lo)), (23) 

where the normalized differential cross-section ip(AS') depends on A and S'(xlo) only through 
the combination AS'; and ip(AS'), which is shown in Figure 3 of Paper I, shows strong 
caustic-induced features at the critical values A\ = 2/3y/3S' and A 2 = 1/5". 

As we mentioned in § 2, when we consider the situation where a significant fraction of the 
dark matter in the universe is in the form of cosmologically distributed compact objects, we 
should compensate the randomly distributed point masses on each lens plane by a negative 
smooth surface mass density. The derivation in this section can be generalized to this case 
if we replace Si in all the equations (except eq. ||12b|| ) by Si + imiX 2 Ei T. The results remain 
valid, but the magnification A should be interpreted as the magnification with respect to an 
"empty beam" (i.e., a light beam that is not affected by the point masses). 



4.2. Three-Dimensional Superposition of Cross-Sections with Distributed 

Shear 

As we discussed earlier, in the case of low optical depth microlensing by a three- 
dimensional distribution of point masses, features associated with the secondary caustics 
are negligible at the high magnification end of the macroimage magnification distribution 
P(A). Therefore, as in the two-dimensional case (see § 3.3 of Paper I), P(A) at high A can 
be approximated as a superposition of the cross-sections of the individual point mass lenses, 
with the shear perturbation due to the other point masses varying from lens to lens. 

Let us assume that the point masses are distributed randomly on N lens planes, each 
with projected (onto the observer plane) surface number density rij. Then P(A) can be 
written as 

P(A) = nJ2p(XLo) dS' P (S'\ X Lo)(T(A\S'), (24) 

L=l J 

where a(A\S') is the cross-section given in equation (^3|), p(S'\xlo) is the probability dis- 
tribution of the effective shear S'(xl ) for a dominant lens at xlo, p(xlo) is the probability 
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that the dominant lens is located on the Lth screen, and n = Y^=i n i is the total projected 
surface number density. Since p(xlo) is simply 

p{XLo) = —, (25) 
n 

we have 

N 

P(A) = J2P(A\ X Lo), (26) 

L=l 

where 

P(A\ X Lo)=n L dS'p(S'\xLo)v(A\S'). (27) 
Jo 

The function P(A\xlo) has the same structure as P(A) of the two-dimensional case (eq.[29] 
of Paper I), and it is the contribution to the macroimage magnification distribution by the 
lenses on the Lth lens screen as the dominant lenses. 

The effective shear S'(xlo) that enters into the cross-section a(A\S') is a weighted su- 
perposition of the shear induced by all except the dominant lens (eq. ||20b|| ). In the Appendix 
we show that 



r\XLo)S' 
[t' 2 (xlo) + S^ 



P(S'\XLo) = r J UL ° ; " (28) 



where 



// \ ( XsiXLo\ 2 ( XsLXio\ 2 /on\ 

r {XLo) = L — — ™i x E,i + L —— nn i x E,i ( 29 ) 

i=l \XsLXio / \XsiXLo / 

is the "effective" optical depth at xlo- [Note that the effective optical depth t'(xlo) is smaller 
than the (usual) optical depth r = Y^x^iX^ i because the extra factors (XsiXLo/XsLXio) 
(for i < L) and (XsLXio/ XsiXLo) (for i > L) are less than unity] Substituting equations (p3|) 
and (p8|) into equation (f2~7|), we find that P(A\xl ) is similar to -P(^4) for a single lens plane 
(compare with eq.[31] of Paper I): 

P(Mxlo) = ^ X ^f /2 fi[Ar'(xLo)), (30) 

where fi(Ar') = J °° dy ip(AT'y)y / \1 + y 2 ) 3 ^ 2 is the function introduced in § 3.3 of Paper I 
to describe the caustic- induced feature in P(A) in the single lens plane case (see Fig. 4 of 
Paper I; also Fig. 12 of this paper). Thus P(A\xlo) exhibits the same caustic- induced bump 
(with a maximum enhancement of 20% at A w 2/r') and the same scaling property (with 
fx being a function of the combination At') as P{A) of the single lens plane case. 

There is, however, a significant difference between the two- and three-dimensional cases. 
To obtain the macroimage magnification distribution in the three-dimensional case, we must 
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sum P(A\xlo) over all the lens planes (eg. [[26 

2im L x 2 E L 



P(A) 



N 



h[Ar'{ X Lo)] 



2t 



where 




f N (A;r) 



f N (A;r) = Y: 



(31) 



(32) 



and r = tvJ2l=i n L x \,L i s the (usual) optical depth. Although we denote the caustic- induced 
feature in the iV-plane Jn{A] t) for simplicity, it should be clear from equation ( p2| ) 

that (A; r) depends on the location and the surface density of the lens planes in a non- 
trivial manner and does not have any simple scaling properties (when N > 1). Note also that 
the differences between fi(Ar) and /W(^4; r) are due to the differences between the optical 
depths t and t'(xlo)- 

To proceed further, we need to assume a model for the distribution of the point mass 
lenses. As in the numerical simulations reported by several authors (Refsdal 1970; Schneider 
& Weiss 1988a,b; Rauch 1991), we consider a lens distribution with constant comoving mass 
density pi. We divide the volume between the source and the observer into N shells of equal 
comoving thickness (A% = Xso/N) and place the point masses in each shell on a lens plane at 
the middle of the shell. In Figure 12 we plot the numerically evaluated fx (A; r) for N — 1, 
2, 4, 8, and 16, and for source redshift z s — ► and = 2. It is clear that /jv has converged by 
N = 16. This agrees with the conclusion from a study of lensing by smoothly variable mass 
distribution that a fully three-dimensional mass distribution is well approximated by as few 
as eight or 16 screens (Lee & Paczyhski 1990). It is also clear that foo{= liniA^oc /jv) is very 
insensitive to the source redshift, at least for z s <^2. 

In the limit iV — > oo, the summations in the expressions for r, t'(xlo), and /at(v4;t) 
are replaced by integrals: 

X (Xso - x) . 



t'(Xlo) 
foo(A;r) 



AnGpial 

T 



Xso 



dx : 



a(x)Xs 



XsL 



dx 



XloX 



o 



Xso , XiXso 

dx 



a(x)XsoXsL 
X) 



+ 



Xs ° , XsL (Xso - X) 



2n 



dx- ( s 

a(X)XsoXLo 



a(x)Xs 



h 



At- 



T 



(33a) 
(33b) 
(33c) 



Equations (33a-b) can be evaluated analytically, and they are particularly simple in the 
limit z s — > (i.e., in the limit a(xso) ~ a>o)'- 

2tt 



-Gpia x 



2 . 

so' 



(34a) 
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(34b) 



rxso 

/oo(A;r) ss 6 / d X 
Jo 




x) 



fi At 



2x(x. 



so 



so 



x) 



(34c) 



Thus the contribution to is dominated by the lens planes at xlo ~ Xso/2 with t'/t <^ 1/2. 
This is consistent with the numerical results shown in Figure 12. The most significant change 
from fi to /oo is the shift in the location of the bump from At ps 2 to At ps 5. Otherwise, the 
peak value (~ 18% enhancement) and the width of are only slightly smaller and wider. 



Several authors (Refsdal 1970; Schneider & Weiss 1988a,b; Rauch 1991) have used nu- 
merical simulations to calculate the macroimage magnification distribution, P(A), produced 
by a three-dimensional distribution of point masses. In the most recent study by Rauch, 
distributions with high resolution in A were obtained from Monte Carlo simulations for three 
cases with a = 0.01, 0.1, and 0.2. The "equivalent a value" is related to the average magni- 
fication (with respect to an empty beam) on the observer plane, A, by A = (1 — cr)~ 2 , i.e., a 
is the optical depth r of a two-dimensional lens distribution with the same A. For a three- 
dimensional lens distribution, a and r are not identical in general, but Aps1 + 2tps1 + 2<t 
in the limit r C 1. 

We wish to compare the semi-analytic P(A) derived in § 4.2 to the distributions obtained 
by Rauch. (We shall focus on his results for an Einstein-de Sitter cosmological background, 
but Rauch also found that the distributions for the same a but different background cos- 
mology are nearly identical.) Since our derivation assumes low optical depth, the best case 
for comparison should be the a = 0.01 case. Unfortunately, even with 5 x 10 5 simulations, 
the Monte Carlo simulations only provide useful information about P(A) for A < 50 (see 
Fig. 2 of Rauch 1991). This is not sufficient for detecting the caustic-induced feature because 
significant deviation from P(A) = 2t/(A 2 — l) 3 / 2 is not expected for A <^ 1/r (ps 100 for 
r ps a — 0.01) (see in Fig. 12). We shall not show the comparison for this case and 
simply note that the numerically obtained P(A) (for A < 50) agrees with the low optical 
depth semi-analytic result, with or without the caustic-induced feature. 

In Figure 13 we show the distributions obtained by Rauch for a — 0.1 and 0.2 (his- 
tograms). The data are those shown in Figures 3 and 4 of Rauch (1991), but they are 
plotted in the form (A 2 — 1) 3 ^ 2 P(A) and we have increased the bin sizes at large A to reduce 
statistical fluctuations. As Rauch et al. (1992) pointed out, the distributions in the three- 
dimensional case do not show the prominent bump at A ^ 2.5 that was found in the planar 



4.3. Comparison with Numerical Simulations 
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case. However, the distributions are not featureless. This can be seen most clearly in the 
a = 0.2 case (note, e.g., the inflection point at log A « 0.6). 

In a plot of (A 2 — 1) 3 ^ 2 P(A), the low optical depth distribution without the caustic- 
induced feature is a horizontal line of amplitude K = 2r, while the semi-analytic distribution 
with the caustic-induced feature is Kf^A^r) (eq.^TJ; recall that is very insensitive to 
the source redshift). Since a — 0.1 and 0.2 are not much less than unity, two corrections 
to the low optical depth semi-analytic results are necessary when we compare them to the 
numerical ones. First, it is no longer accurate to assume that r = a. For these particular 
models, cr = 0.1 and 0.2 correspond to r = 0.108 and 0.233, respectively. Second, the 
normalization K can be significantly different from the low optical depth result 2r. In § 5.1 
of Paper I, we discussed the renormalization of P(A) due to the net convergence of light rays 
and multiple streaming for the single lens plane case. From an analytic result for P(A) in 
the high magnification limit, which is exact even for finite optical depth r (see also Schneider 
1987), we adopted the normalization 

K = 2T (T+7TpV2' (») 
(For t <C 1, A « 1 + 2r and (1 + r 2 )~ 3 / 2 « 1 — 3r 2 /2, and the correction to the normalization 



2t is small.) The analysis that led to equation (J35 ) cannot be easily generalized to the three- 
dimensional case (it would require the statistical properties of the matrix B which depends 
on the shear matrices «Sj non-linearly; see eq. [14]), and an equivalent expression for K for a 
three-dimensional lens distribution is not known. Nevertheless, in the range of r of interest 
here (i.e., for r <^ 0.25), it is likely that equation ([35]) is a good approximation for K in the 



three-dimensional case. In equation (35), the dominant correction to the normalization 2r 
is the factor A. (The correction due to the factor (1 + r 2 )~ 3//2 is second order in r and less 
than 10% for r <^ 0.25.) The factor A arises from converting a probability distribution in 
Lagrangian space to a distribution in Eulerian space (see § 5.1 of Paper I). The conversion 
(and hence the factor A) should remain in any generalization of the analysis to the three- 
dimensional case. Furthermore, as in the planar case (see § 5.1 of Paper I), we can derive 
the factor A using a different argument: in § 4.2 we assumed that the surface density of the 
astroid-shaped caustics on the observer plane is the same as the projected surface density 
of point masses n (see eq. |[24|| ); there is, however, a net convergence of the light rays by the 
overall lens distribution; this increases the density of astroids by a factor A, and introduces 
an additional factor A in the normalization of P(A) in equation fl3~l~|). Therefore, we shall 



adopt equation (^) for the normalization K in the three-dimensional case. 

In Figure 13 we plot K (dashed lines) and Kf^A^r) (solid lines) for a = 0.1 and 
0.2. They should be compared to the numerical results at log A ^ 0.3 (see § 4.5 for a 
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brief discussion of the deviations at small A). As we can see, the adopted normalization 
K is in good agreement with the amplitudes of the numerical results. In addition, the 
numerical results show deviations from the horizontal dashed lines that are similar in shape 
to the semi-analytic caustic-induced feature foo (note, e.g., that the inflection point in the 
numerical result for a = 0.2 coincides with the location ApxI/t where starts to deviate 
significantly from unity). In the a — 0.1 case, there is little, if any, difference between 
the numerical and semi-analytic results; the only possibly statistically significant deviation 
is that the numerical distribution may be higher near log A xs 1. In the a = 0.2 case, 
the difference between the numerical and semi-analytic results is more pronounced, with 
the numerical distribution slightly higher than the semi-analytic distribution over the range 
0.7 < log A < 2. Note, however, that the number of events in the histograms at log A > 1.6 
is small (see the fluctuations amongst the same bins in the a = 0.1 case), and the true 
distribution may not be as flat as the histograms shown. 



4.4. Criterion for Low Optical Depth 

The comparison in § 4.3 shows that, in the case of a three-dimensional lens distribution, 
the function f^, which was derived in the low optical depth limit, provides a reasonably 
good fit to the caustic-induced feature in P(A) for r as large as 0.233. This is very different 
from the two-dimensional case studied in Paper I. For a two-dimensional lens distribution, 
the caustic-induced feature in the numerically obtained P(A) is more prominent than the 
low optical depth semi- analytic feature j\ for r as small as 0.1. This was shown in Figure 6 of 
Paper I where we compared the numerical and semi-analytic distributions for r(= a) =0.1 
and 0.2 in a plot of (A 2 — 1) 3 ^ 2 P(A). The numerical distributions show a clear minimum 
at A m 2.5 and are significantly higher than the semi-analytic results in the range 0.5 ^ 
logA^l. 

For both the two- and three-dimensional lens distributions, the low optical depth anal- 
ysis assumes that the point mass lenses are well separated in Lagrangian space. Then the 
deflection near each point mass is due to that point mass and the shear perturbation from 
the other lenses, and an astroid-shaped (primary) caustic is associated with each point mass. 
The low optical depth analysis is not valid if the lens density is high enough that a significant 
fraction of the lenses have close neighbor (s), and their caustics are complex structures pro- 
duced by the collective effect of two (or more) point masses. In § 5.3 of Paper I, we estimated 
for the two-dimensional case the fraction of point masses whose caustics are not isolated as- 
troids, P na (r), by considering a point mass and its nearest neighbor as a two-point-mass 
lens: P na ,{r) = 1 — exp(— 8r). For r = 0.1, P na = 0.55. This explains the relatively strong 
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contribution to P(A) by the collective effect of two (or more) point masses (which results in 
the enhanced caustic-induced feature at moderate A) for r as small as 0.1. 

To understand the difference between the two- and three-dimensional cases, we need to 
estimate P na {T) for the three-dimensional case. We now extend the method used in Paper I to 
estimate P q &{t) to the three-dimensional case. We begin by considering lensing configurations 
with two point masses on different lens planes. Since our results in § 4.2 and the numerical 
results of Rauch (1991) show that P(A) is not sensitive to either the source redshift or the 
background cosmology, we shall simplify the analysis by assuming that a(xso) ~ clq- Then 
the two point mass configurations of interest are identical to those studied in § 3.1 and are 
specified by two parameters: the dimensionless distance /3y between the lens planes and the 
dimensionless Lagrangian separation d between the lenses [the actual Lagrangian separation 
1S ( x e i+%E j) d', see eq.(0)]. Note that we have changed the notation slightly and the point 
masses are on lens planes i and j instead of lens planes 1 and 2. As we saw in § 3.1, the 
topology of the caustics (and the critical curves) changes with decreasing d from type A to F. 
The only topology with two astroid-shaped primary caustics is type A; the other topology 
types are produced by the strong effect of both point masses (see Fig. 6). In Figure 14 we 
plot the separation dAB at which the topology changes from type A to type B as a function 
of Pij. (The data were obtained using the numerical code that generated the caustics and 
critical curves shown in Figs. 2-10.) The separation dAB decreases with increasing fy. In the 
limit fa = 0, the two point masses are on the same plane and we have dAB — 2 (Schneider 
& Weiss 1986). In the limit /3jj — > 1 (i.e., in the limit i — > s or j — > o; see eq.@), dAB — > 0; 
this is due to the focusing of the light rays by the point mass on lens plane i and the fact 
that xej/xe,i — > 0. 

Let us consider first a simple extension of the method used in Paper I to estimate P na (T)- 
It is convenient to consider the projected distribution (i.e., the distribution in Lagrangian 
space) of the point mass lenses. If we select randomly one of the point masses, the probability 
that the point mass is on lens plane i is n«/n, where n = Y^i=i n i is the total projected 
surface number density of point masses. This point mass has an absolute nearest neighbor 
in Lagrangian space. Since the point masses are distributed randomly, the probability that 
the absolute nearest neighbor is on lens plane j and at a Lagrangian distance less than r is 
(nj/n)[l — exp(— Tmr 2 )]. If we ignore the deflection due the other lenses and consider the 
point mass and its nearest neighbor as a two-point-mass lens, the discussion in the previous 
paragraph tells us that the primary caustic associated with the point mass is not an isolated 
astroid if r < [x 2 Ei + x E jY^ 2 dAB(Pij)- Therefore, an estimate of the fraction of point masses 
whose caustics are not isolated astroids is 

N 

p L,n( t ) = Yl ~ZT {l - exp [-nn (x 2 E)i + x 2 E>j ) c^ B (/%)] } . (36) 
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In the three-dimensional case, since the (Lagrangian) Einstein radius x e ,l increases 
from to oo as xlo increases (see eq. 0), it is possible, e.g., for the primary caustic (and 
critical curve) of a point mass to merge with that of its second nearest neighbor (from plane 
k), but not with that of its absolute nearest neighbor (from plane j), if xe,h 3> x ej- Thus 
the function P^n^t), which takes into account the absolute nearest neighbor only, probably 
underestimates the fraction of point masses whose caustics are not isolated astroids. We 
can derive a better estimate by taking into account the nearest neighbors from all of the 
lens planes. For a randomly selected point mass (on lens plane i with probability nt/n), 
the probability that the nearest neighbor from lens plane j is at La grangian distance greater 
than r is exp(— imjr 2 ). If we ignore the deflection due the other lenses and consider the point 
mass and its nearest neighbor from lens plane j as a two-point-mass lens, the two-point-mass 
lens produces two astroid-shaped primary caustics if r > (x Ei + x^jY^dABiPij)- Thus the 
probability that the primary caustic associated with the point mass is an isolated astroid 
is T\f=i exp[— imj(x E i + x E j)d\ B (Pij)}. Therefore, a better estimate of the fraction of point 
masses whose caustics are not isolated astroids is 

Pna,7v(T) = — i 1 - eX P ^ J2 n J ( X E,i + *Ej) <4b(Aj) 



n 



3=1 



(37) 



As with fw(A;r), although P^ N and P naj Ar are denoted as functions of r, they do 
depend on the location and the surface density of the lens planes. For N = 1, i.e., for a 
single lens plane, both P na ,iv(T) and Pn a Ar( r ) reduce correctly to P na ,(j) = 1 — exp(— 8r). In 
the limit r<l, 

PnMr) - PLAr) - r £ M ^ + ^ d\ B {^). (38) 

In Figure 15 we show -Pn aA r(T) for N = 1, 2, 4, 8, and oo (dashed lines) and P n a,oo(' r ) (solid 
line), using the constant comoving density model adopted in § 4.2. Both P^ aN and P na ,7v 
have converged by iV 16. The difference between -P^oo an d -Pna,oo is small for r ^ 0.1, but 
it increases to 0.10 at r = 0.5. 

It is clear from Figure 15 that P n a,oo( r ) is significantly smaller than P n a( r ) (the highest 
dashed curve in Fig. 15). In the limit r< 1, -Pna,oo(T) ~ Pnaoo( T ) ~ 2.5r while P n a(T) ~ 8r. 
For r = 0.1, P nai<X) = 0.20 and P na = 0.55; even for r = 0.233, P naj00 = 0.40 only. Thus, 
in the three-dimensional case, the fraction of point masses whose (primary) caustics are 
not isolated astroids and the contribution to P(A) by configurations of two (or more) point 
masses are much smaller. This explains why the low optical depth semi-analytic feature foo 
provides a reasonably good fit to the caustic-induced feature in P(A) for r as large as 0.233. 
At sufficiently large r (> 0.35), P na)0 o > 0.5 and P(A) should deviate significantly from the 
semi-analytic result. 
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4.5. Modification of P(A) at Low Magnification 

Finally, we discuss briefly the deviations of P(A) from the semi-analytic distribution 
Kfoo/ {A 2 — l) 3 / 2 at low magnification (see Fig. 13). As in the planar case, the superposition 
of the cross-sections of individual lenses that results in the caustic-induced feature foo is only 
valid for the magnifications seen by the observers close to one of the astroid-shaped primary 
caustics. The semi-analytic distribution also requires modification at low magnification 
as it diverges as (A — 1)~ 3 / 2 at A = 1 and is not normalizable. In the planar case, an 
observer not particularly well aligned with any single lens sees a primary image close to 
the unperturbed position of the source and a large number of faint secondary images (one 
close to each lens) produced by strongly deflected rays. In Paper I, we took into account the 
correlations between the magnifications of the primary and the secondary images and derived 
the modification of P(A) at low magnification. We have not attempted a similar calculation 
in the three-dimensional case because the problem is significantly more complicated. First, 
for an observer that is not particularly close to either a primary or a secondary caustic, 
there are now ~ 2^* secondary images (where A* is the number of lenses) because a light 
ray can reached the observer after being strongly deflected by one or more lenses. Although 
the secondary images that have been strongly deflected by more than one lens are typically 
much fainter than those that have been strongly deflected by only one lens, they are far 
more numerous and we may have to account for them. A more serious problem is the 
presence of weak caustic-induced features at low magnification that are associated with the 
secondary caustics (see § 3). Unlike the features associated with the primary caustics at high 
magnification, there are no simple semi-analytic descriptions of the features associated with 
the secondary caustics. 



5. SUMMARY 

In this paper and Paper I, we have sought to develop a systematic theory of gravitational 
microlensing. We focused on microlensing by a planar distribution of point masses in Paper I. 
In this paper we have studied gravitational microlensing by a three-dimensional distribution 
of point masses, focusing on the nature of the critical curves, the caustics, the illumination 
pattern, and especially the macroimage magnification distribution P(A). In the process, 
we have explained many of the differences between two- and three-dimensional microlensing 
that have been seen in previous numerical simulations. 

In § 3 we studied the lensing properties of three-dimensional lens distributions by con- 
sidering in detail multiplane configurations with two, three, and four point masses. The 
lensing properties of a three-dimensional lens distribution is fundamentally different from 
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that of a planar distribution in that a light ray could have arrived in the vicinity of a lens 
either having suffered only weak deflections along the way from the source, or after having 
been strongly deflected one or more times by the intervening lens. The latter situation can- 
not arise in a purely planar distribution of lenses, and it gives rise to secondary caustics and 
critical lines that have no analogues in the planar case. In addition, in the case of more 
than two point masses, the caustic topology is no longer simply related to the critical curve 
topology because caustics corresponding to different critical lines may overlap each other. 

For iV* point masses that are widely separated in Lagrangian space, there are ~ iV* 
primary critical curves (i.e., critical curves that have not suffered any strong deflections) 
and ~ 2^* — 1 critical curves in total. Features in the "normalized" differential cross- 
section, (p(A), that are associated with the secondary caustics are relatively weak and at 
low magnifications. Significant features occuring at moderate to high magnifications are 
invariably associated with the primary caustics. 

In § 4 we derived a semi-analytic expression for the caustic-induced feature at the high 
magnification end of P(A) for a low optical depth lens distribution. In the low optical depth 
limit, we showed that the multiplane lens equation near a point mass is formally equivalent 
to the single plane equation of a point mass perturbed by shear. Consequently, the (primary) 
caustic has the familiar astroid shape, and the differential cross-section exhibits the same 
scaling behavior found in Paper I (i.e., (p depends on magnification A and the effective shear 
5" through the combination AS'). By modeling the illumination pattern as a superposition 
of the patterns due to individual "point mass plus weak shear" lenses, we found that the 
caustic- induced feature in the macro image magnification distribution P(A) can be written as 
a weighted sum of the semi-analytic feature derived in Paper I for a planar lens distribution 
(eq.|32[). The resulting semi-analytic caustic-induced feature is similar to the feature in the 
planar case, but it is shifted to higher magnification. 

In the moderate to high A regime, our semi-analytic P{A) agrees remarkably well with 
the magnification distribution obtained from numerical simulations by Rauch (1991), even 
for optical depth as high as r ~ 0.2. This is very different from the two-dimensional case 
studied in Paper I. In the two-dimensional case, we have found that the caustic-induced 
feature in P(A) tends to be more pronounced than the low optical depth result, even for 
optical depth as low as r ~ 0.1. We argued that this particular difference between two- 
and three-dimensional microlensing is due to the fact that for a given value of optical depth, 
the fraction of point masses whose (primary) caustics are not simple astroids is significantly 
higher in the planar case than in the three-dimensional case. Finally, we discussed briefly the 
deviations of the numerical P{A) from the semi- analytic distribution at low magnification. 
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A. STATISTICS OF SHEAR FOR A LOW OPTICAL DEPTH 
THREE-DIMENSIONAL LENS DISTRIBUTION 

In this Appendix we evaluate p(«S"| Xlo), the probability distribution of the effective shear 
S'(xlo) at a given xlo, due to a low optical depth three-dimensional distribution of point 
masses. It is assumed that the point masses are distributed randomly on A" lens planes, each 
with projected (onto the observer plane) surface number density rij. Specifically, on each 
lens plane, Ni = nriiRf point masses are distributed randomly within a circle of Lagrangian 
radius R;. 



At a given xlo, the effective shear (in matrix form) is (eq. ||20b|| ) 

S' L = s'( X Lo)f(4>' L ) = £ & + £ (*^4 (Al) 

i=l \XsLXio ) i=L+l \XsiXLo J 



where (eq. ||12b|1 ) 



§ _ x 2 1 ( cos2(p ik sin2(j) ik \ 

1 ~ XE ' l t{ \ sin20 ifc -cos20 ife )' [ ' 

It is convenient to represent the two distinct components of the shear matrix Si as a vector: 

&i — %E,i ^ S ik , (A3) 
k 

where Si k = (cos 2<pi k /\xi k \ 2 , sin 2(pi k /\xi k | 2 ). Then the effective shear S' L can be written as 
an appropriately weighted vector superposition: 

l /„, „, \ N 



5u \ ( XsiXLo\ 2 ( XshXio\ 2 
iXLo) = }^ X E,i 2^ S ik+ }^ X E,i 1^ S ik 

i=l \XsLXio J k i=L+l \XsiXLoJ k 



(A4) 



N 

= X ^« X! s « fc ' 

i=l k 

where we have defined Gi to simplify the notations. 

Since all the are statistically independent and have similar statistical properties, the 
formal expression for the probability distribution of S'(xlo) is 



p(S'\ X Lo) =/■■■/ ft II d 2 s lkP (s lk ) 5 2 (s' -J2 G *H ^) 

J J i=l k V i=l k / 



(A5) 



where S 2 (s) is the two-dimensional delta function. To evaluate equation (A5), it is convenient 
to express the delta function and p(s ik ) as Fourier transforms: 

5 2 (tf =j^y 2 j d 2 ue-^ S '-^ G ^ k ), ( A6 ) 
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p(s ik ) = j d 2 t lk q(t ik ) e~ itik - Sik , (A7) 

where the characteristic function q(t ik ) is (Nityananda & Ostriker 1984) 

TXTl ■ 

q(t ik ) = 1 - -jjr-tik. (A8) 

Substituting equations (A6) and (A7) into equation (A5), we can perform the integration 
with respect to s^: 

J d 2 S tk e -^ k -it ik - Gi U) = 5 2 ^ _ GiU) (AQ) 

Next, we integrate with respect to t ik , grouping together the Ni integrals associated with 
the ith screen: 

/• • •/ n a* 52 (** - = (1 - Ni — ^ niGiu ( ai °) 

in the limit Ni — > 00. Hence, 

p(Sf|XLo) = (2^2 / d'tie^e-*"- 5 ', (All) 
where the "effective" optical depth t'(xlo) is 

r'(xLo) = tt^G^ = ( XsiXl A irniX 2 Et i + ^ ( XslXio \ ( A12 ) 

j i=l \XsLXio J i=L+l \XsiXLo ) 

Performing the integration with respect to u in equation (All), we obtain 

1 r°° 1 
p(S'\xlo) = 7T- / duue- TU J (S'u) 
An Jo 



(A13) 



1 r' 
2^ (r'l + S' 2 f 2 ' 



Since p(S'\xlo) depends only on the magnitude of the effective shear S'(xlo), it is convenient 
to consider the probability distribution for the latter: 

p(S'\ X Lo) = T>{xLo)S ' (A14) 

[r' 2 (xLo) + S' 2 f 2 

We have expressed the probability distribution explicitly as a conditional probability that 
depends on the value of xlo- It should be noted that p{S'\xlo) has the same functional form 
as the shear distribution due to a planar random distribution of point masses (eq. [30] of 
Paper 1). 
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Table I. Configurations with Two Point Masses 



&2 


d 


Topology 


Pixel Sizes 


0.0 


2.4 


A' 


0.020, 0.032 


0.0 


1.0 


B' 


0.016, 0.024 


0.0 


0.6 


a 


0.016, 0.024 


0.3 


2.0 


A 


0.020, 0.040 


0.3 


1.0 


B 


0.018, 0.036 


0.3 


0.6 


C 


0.016, 0.032 


0.3 


0.5 


D 


0.016, 0.032 


0.3 


0.4 


E 


0.016, 0.032 


0.3 


0.2 


E 


0.020, 0.040 
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Table 2. Configurations with Three and Four Point Masses 



Case 




i 


Xsi/ Xso 




Pixel Sizes 


a 


3 


1 


0.1 


(-0.50,0.00) 


0.010, 0.024 






2 


0.5 


(0.20,0.00) 








3 


0.9 


(0.30,0.25) 




b 


3 


1 


0.55032 


(0.35,0.00) 


0.012, 0.024 






2 


0.79370 


(-0.40,0.65) 








3 


0.94104 


(-0.30,-0.50) 




c 


4 


1 


0.5 


(0.30,0.00) 


0.012, 0.024 






2 


0.72112 


(0.00,0.65) 








3 


0.85499 


(-0.80,0.00) 








4 


0.95647 


(0.00,-0.50) 
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Fig. 1. — A schematic diagram illustrating our notation for multiplane gravitational lensing. 
A light ray that leaves the source with angle 6 has Lagrangian coordinates x(— Xi). It 
is deflected on the N lens planes and reaches the observer plane at Eulerian coordinates 
r(= x N+1 ). 

Fig. 2. — The critical curves of the configurations with two point masses (m' 1 = 1 — m' 2 = 
l/[2 — Pu])- The dimensionless distance between the lens planes (3i 2 = (i.e., the two 
point masses are on the same plane). Each panel is labeled by the Lagrangian separation, 
d, between the two point masses, whose Lagrangian positions are indicated by the crosses. 

Fig. 3. — The caustics (left panels) and the iso- magnification contours (right panels) of the 
configurations shown in Fig. 2. The contour levels are chosen to highlight interesting features 
in the magnification patterns. 

Fig. 4. — The "normalized" differential cross-sections, tp(A), of the configurations shown in 
Fig. 3. Note that ip(A) = 1 for a single point mass of unit mass. In each panel, the thick 
(thin) line was obtained from the ray-shooting calculation with the smaller (larger) pixel size 
listed in Table 1. 

Fig. 5. — Same as Fig. 2, but for the configurations with /5 i2 = 0.3. 

Fig. 6. — Same as Fig. 3, but for the configurations shown in Fig. 5. Note that there is an 
offset between the left and right panels of (a). 

Fig. 7. — Same as Fig. 4, but for the configurations shown in Fig. 6. 

Fig. 8. — (a) The magnification factor A as a function of the x-coordinate for the magnifi- 
cation pattern shown in the right panel of Fig. 6c and y = 0. (b) The magnification factor 
A as a function of the ^/-coordinate for the magnification pattern shown in the right panel 
of Fig. Qd and x = 2.3. (c) Same as (a), but for Fig. 6e. (d) Same as (b), but for Fig. 6/ and 
x = 2.4. 

Fig. 9. — The critical curves of the configurations (a) a, (b) b, and (c) c of Table 2. The 
Lagrangian positions of the point masses are indicated by the crosses. 

Fig. 10. — The caustics (left panels) and the iso-magnification contours (right panels) of the 
configurations shown in Fig. 9. The contour levels are chosen to highlight interesting features 
in the magnification patterns. 

Fig. 11. — The "normalized" differential cross-sections, tp(A), of the configurations shown in 
Fig. 10. In each panel, the thick (thin) line was obtained from the ray-shooting calculation 
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with the smaller (larger) pixel size listed in Table 2. 

Fig. 12. — The function f^(A]r) (eq.p2|) that describes the caustic-induced feature in the 
macroimage magnification distribution, P(A), at low optical depth r. The solid (dashed) 
lines from left to right show for N = 1, 2, 4, 8, and 16, and for source redshift z s — > 
(z s = 2). A lens distribution with constant comoving mass density is assumed (see § 4.2). 

Fig. 13. — Comparison of the semi-analytic macroimage magnification distribution P(A) = 
Kf OQ (A; t) I (A 2 — l) 3 / 2 (solid lines) with the Monte Carlo results of Rauch (1991; histograms) 
for cr = 0.1 and 0.2 (or optical depth r = 0.108 and 0.233). We adopt equation (j35|) for the 
normalization K (dashed lines). 

Fig. 14. — The Lagrangian separation dAB at which the critical curve (and caustic) topology 
of the two-point- mass lens changes from type A to type B as a function of fy. 

Fig. 15. — The fraction of point masses whose caustics are not isolated astroids as a function 
of the optical depth r. The dashed lines from top to bottom are P^ aN (eq.JJ^j) for N = 1, 2, 
4, 8, and oo, and the solid line is P na ,oo (eq.|j3~?|). A lens distribution with constant comoving 
mass density is assumed (see § 4.2). 
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